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DYNAMIC RESOURCE ALLOCATION USING 
PROJECTED FUTURE BENEFITS 

CROSS REFERENCE TO RELATED APPLICATION 

The subject matter this application is related to application Serial No. 

5 09/ , filed concurrently herewith by Tracy Kimbrel et al. for "Dynamic 

Resource Allocation Using Known Future Benefits" (IBM Docket 
YOR9-2000-0824US1). The subject matter of application Serial No. 
09/ , is incorporated herein by reference. 

p DESCRIPTION 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention generally relates to benefit task systems and, 
more particularly, to a policy for allocating resources to maximize some 
benefit. The invention may be applied to a variety of problems, and the benefit 
may be either tangible (e.g., profit) or intangible (e.g., customer satisfaction). 
In a specific example, the invention has particular application to server 
allocation in a Web site server "farm" given limited information regarding 
future loads to maximize profits for the Web hosting service provider. In 
another specific example, the invention can be applied to the allocation of 
telephone help in a way to improve customer satisfaction. In yet another 
example, the invention may be applied to distributed computing problems 
where the resources to be allocated are general purpose computers connected 
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in a network and used to solve computationally intensive problems. 

Background Description 

Web content hosting is an important emerging market. Data centers 
and Web server "farms" are proliferating. The rationale for using such centers 
5 is that service providers can benefit from economies of scale and sharing of 
resources among multiple customers. This benefit in turn translates to lower 
cost of maintenance for the customers who purchase these hosting services. 
Web content hosting services are structured in many ways. One of the most 
prevailing ways is outsourcing: the customers deliver their Web site content in 

10 response to HTTP (hyper text transfer protocol) requests. Service providers 
will use "farms" of commodity servers to achieve this goal. 

One of the components in the payment for such a service is "pay per 
served request". Thus, one of the main objectives of the service provider is to 
maximize the revenue from served requests while keeping the tab on the 

15 amount of resources used. Ideally, the allocation to a Web site should always 

suffice to serve its requests. However, due to a limited number of servers and 
the overhead incurred in changing the allocation of a server from one site to 
another, the system may become overloaded, and requests may be left 
unserved. Under the assumption that requests are not queued, a request is lost 

20 if it is not served at the time it is requested. The problem faced by the Web 
hosting service provider is how to utilize the available servers in the most 
profitable way. What adds to the complexity of this problem is the fact that 
future load of the sites is either unknown or known only for the very near 
future. A Web content hosting service provider therefore needs a procedure to 

25 dynamically allocate servers in a server "farm" to its customers' Web sites. 

Similar considerations apply in the cases of computer servers and 
telephone support centers. Telephone support centers typically are computer 
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controlled telephone networks having a number of technical support, order 
support and customer service support operators. These operators are resources 
that must be allocated to customers who call in. Computer software is used to 
answer telephone calls and direct the calls to the appropriate pool of operators. 
In this application, the operators are the resources to be allocated. The wait 
time that a customer experiences is inversely proportional to customer 
satisfaction and, therefore, it is important to be able to dynamically allocate 
resources in such a manner as to minimize customer wait time and increase 
customer satisfaction. In this application, customer benefit is the intangible 
benefit which is sought to be maximized. 

In yet another example, the resources to be allocated are general 
purpose computers used to solve computationally intensive problems. In this 
environment, multiple computers can be used concurrently to solve a problem 
faster than a single computer can solve it. The computers would be connected 
in a network which may include the Internet. It has even been proposed that 
personal computers connected to the Internet might constitute resources that 
could be employed in solving such problems. It is anticipated that a market for 
such services will become standardized to some extent, so that the computer 
cycles become a commodity (resource) available from multiple vendors. It 
would be an advantage to dynamically allocate such resources. 

SUMMARY OF THE INVENTION 

It is therefore an object of the present invention to provide a method 
for server allocation given limited information regarding future requirements. 

It is another object of the invention to provide a method for server 
allocation to achieve close to the greatest possible benefit based on the 
assumption that benefit is proportional to the utilization of resources. 

According to the invention, the method of resource allocation uses an 



4 

approach of "discounting the future". Specifically, when the policy faces the 
choice between a guaranteed benefit immediately and a potential benefit in the 
future, the decision is made by comparing the guaranteed benefit value with a 
discounted value of the potential future benefit. This discount factor is 
5 exponential in the number of time units that it would take a potential benefit to 

be materialized. The future benefits are discounted because by the time a 
benefit will be materialized, things might change and the algorithm might 
decide to make another choice for a potential (even greater) benefit. 

In the practice of the invention, the resource allocation problem is 
10 modeled mathematically. In the model, time is divided into intervals. For the 

Web server farm problem, the assumption is made that each site's demand is 
2 uniformly spread throughout each such interval. Server allocations remain 

□ fixed for the duration of an interval. It is also assumed that servers are 

O 

u reallocated only at the beginning of an interval, and that a reallocated server is 

+: 15 unavailable for the length of the interval during which it is reallocated. This 

s represents the time to "scrub" the old site (customer data) to which the server 

U | was allocated, to reboot the server and to load the new site to which the server 

Gf has been allocated. The length of the time interval is set to be equal to the non- 

Q negligible amount of time required for a server to prepare to serve a new 

20 customer. In current technology, this time is in the order of 5 or 1 0 minutes. 

Each server has a rate of requests it can serve in a time interval. For 
simplicity, all rates are assumed to be identical. Due to practical concerns 
(mainly security constraints placed by customers), sharing of servers at the 
same time is not allowed. That is, customers share servers only in the sense of 
25 using the same servers at different times, but do not use the same servers at the 
same time. Thus, even in case of overload, some of the servers may be 
underutilized if they are allocated to sites with rates of requests lower than the 
servers' rate. 
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Each customer's demand is assumed to be associated with a benefit 
gained by the service provider in case a unit demand is satisfied. Given a fixed 
number of servers, the objective of the service provider is to find a time- 
varying server allocation that would maximize benefit gained by satisfying 
5 sites' demand. In a fully online implementation of the invention, only the 

demand of the current interval is known at the beginning of the interval. It is 
assumed that some amount of future demand is known to the service provider, 
and this is modeled by lookahead, which means that the demand of the sites is 
known for the next predetermined number of time intervals (following the 

10 current time interval). Lookahead requires that some forecasting mechanism is 
used to predict future demands. In the online setting, an online allocation 
algorithm is measured by its competitive ratio; that is, the maximum over all 
instances of the ratio of the benefit gained by the optimal offline allocation to 
the benefit gained by the online allocation on the same instance. Almost 

15 optimal algorithms (both deterministic and randomized) are used for any 
positive lookahead. The quality of the solution improves as the lookahead 
increases. 

Interestingly, the model can be cast as a more general benefit task 
system. In this task system, we are given a set of states for each time, and a 

20 benefit function. The system can be at a single state at each time, and the 

benefit for time / is a function of the system states at times t - 1 and /. The 
goal is to find a time varying sequence of states that maximizes the total 
benefit gained. That is, at each time t, we need to determine to which state 
should the system move (and this will be the state of the system at time t + 1), 

25 and we gain the benefit that is determined by the benefit function. Similar to 

the server farm model, in the fully online implementation of the invention, 
only the benefit of the current step is known at the time the next move is 
determined, and in case of lookahead, Z, the benefits of the next L steps (in 
addition to the current) are known. 

YOR9-2000-0831US1 
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It can be shown that benefit task systems capture also benefit 
maximization variants of well studied problems, such as the A>server problem 
(see A. Borodin and R. El-Yaniv in On-line Computation and Competitive 
Analysis, Cambridge University Press, 1998) and metrical task systems (see 
5 On-Line computation and Competitive Analysis, cited above). Thus, our 

results hold for these variants as well, and show that the benefit variants of 
these problems may be more tractable than their cost minimization variants. 



BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing and other objects, aspects and advantages will be better 
understood from the following detailed description of a preferred embodiment 
of the invention with reference to the drawings, in which: 

Figure 1 is a block diagram illustrating the architecture of a Web 
server farm; 

Figure 2 is a flow diagram illustrating the process of allocating servers 
using projected future benefits; and 

Figure 3 is a flow diagram illustrating the process of computing new 
allocations of servers. 

DETAILED DESCRIPTION OF A PREFERRED 
EMBODIMENT OF THE INVENTION 

20 Although the invention is described in terms of a specific application 

to a Web server farm, this explanation is by way of example only. It will be 
understood by those skilled in the art that the invention may be applied to 
other applications. Among those applications are the customer telephone 
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support problem and the allocation of computers to computationally intensive 
problems already mentioned. The Web server farm problem will serve to 
provide a concrete application of the invention which can be applied to other 
resource allocation problems. 
5 Referring now to the drawings, and more particularly to Figure 1 , there 

is shown, in generalized form, the architecture of a Web server farm of the 
type managed and maintained by a Web hosting service provider. The farm 
itself comprises a plurality of servers 10 1? 10 2 , 10 3 , . . ., 10„ connected to a 
server farm network 1 1 . The server farm network 1 1 connects the servers via 
10 the Internet 12 to a plurality of Web browsers 13 lf 13 2 , . . ., 13,. Customers of 
the Web hosting service provider purchase hosting services, which include 
maintaining the customers' Web sites so that the Web sites can be readily 
accessed by persons using the Web browsers. The Web hosting service 
provider seeks to maximize profits by allocation of resources, i.e., the servers. 
15 This is the function of the dynamic resource allocator 14 connected to the 
server farm network. 

Figure 2 illustrates the general process implemented by the dynamic 
0 resource allocator 14. The process begins in function block 21 where the 

current and forecasted per-customer demands and revenues are obtained. 
20 Next, current allocations of servers are obtained in function block 22, and then 
new allocations are computed in function block 23. The computed new 
allocations are compared with the current allocations in decision block 24 and, 
if they are different, then in function block 25, re-allocated servers are directed 
to serve their new customers before the process loops back to function block 
25 21. If the new allocations are not different from the current allocations, as 

determined in decision block 24, the process goes directly to function block 21 
to begin the process anew. 

The first function block 21, obtain current and forecasted per-customer 
demands and revenues, is outside the scope of the present invention. We 
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assume some forecasting mechanism is used to determine the projected 
demands and benefits. As will be explained in more detail, the present 
invention is the algorithm for deciding what the allocations should be based 
on a given forecast; i.e., function block 23, compute new allocations. 
5 Allocations are made based on the result of this computation. 

The Web Server Farm Problem 

Suppose that we are given s Web sites that are to be served by k Web 
servers. (For simplicity, we assume that all servers are identical.) Time is 
divided into units. It is assumed that the demand of a Web site is uniform in 
each time unit. Each server has a "service rate" which is the number of 
requests to a Web site each server can serve in a time unit. Without loss of 
generality, we normalize the demands by the service rate so that a server can 
serve one request per time unit and demands of a site may be fractional. A 
Web server can be allocated to no more than one site at each time unit and it 
takes a time unit to change the allocation of a server. 

A problem instance consists of the number of servers, k, the number of 
sites, 5, a non-negative benefit matrix, b u , denoting the benefit gained by 
serving a request of site i G [1 ...s] for time step t > 1, and a non-negative 
demand matrix, {d u } 9 denoting the number of requests at site i for time step t. 
The goal is to find for each site i E [l...s] a time varying allocation {a it } of 
servers, so as to maximize the total benefit, as follows. The allocation must 

s_ 

satisfy that for each t 9 a iJt <> k. Only a! i t = min { a it „ v a iJt } of the 
i=i 

servers allocated to site i for the time step t are "productive", i.e., actually 
serve requests. We get that the total benefit of an allocation {a l t } is 




20 
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E E K • 111111 { 4 >t > } + EE A y ,min { d ut> a uv vi }• 

fel z-1 fel 1=1 

In the online solution of the problem according to the present 
invention, we need to compute the allocation {a i t } at time t given the demands 
so far, i.e., {d it .} for all i and t'<t. It is not difficult to show that any fully 
online algorithm may perform arbitrarily badly with respect to the offline 
optimal solution. This is true because of the one time unit lag in a server's 
availability. Thus, we consider online algorithms with lookahead. In the online 
verison with lookahead L, at each time t we are additionally given the 
demands for times t+l, . . ., t+L, i.e., the entries d it > for i G [l„.s] and 
t+1 <t' <r+L, and we need to compute the allocation at time t, i.e., a lt for 

The Benefit Task System Problem 

The Web server farm problem is a special case of the generalized task 
system benefit problem. In this problem, we are given (i) a set of possible 
states U t9 for each time t>0 9 and (ii) a non-negative benefit function B whose 
domain is u^U^U^), that denotes, for each time t>Q, the benefit that is 
accrued (at time H-l) by the transition from a state U { to a state C/, +1 . The goal 
is to choose a state st for each time t so as to maximize the total benefit 

t 

In the online version of the problem with lookahead L, the state s t E U t 
should be computed based on knowing only U t * for t'<t+L 9 and the restriction 
of the function B to pairs of these sets of states. 

Observe that the Web server farm problem can be cast in this setting 
by identifying each possible allocation of servers to sites at time t with a state 
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5 „ and defining the benefit function B(S 4 „ S,, to be the benefit gained by 
changing the allocation at time t from the one represented by S, . , to the 
allocation at time t+\ represented by S Jt M . (In a sense, the set of states for all 
times is the same.) The number of states is exponential in the number of 
servers k, so the states and the benefit functions are implicit and follow from 
the more succinct representation of the Web server farm problem. For 
example, the values B(s „ s J: „,) are not listed explicitly, and any single value 
can be efficiently computed when necessary. 

Benefit maximization variants of well known problems can also be 
cast in this setting of a benefit task system. Consider the benefit version of the 
^-server problem with one difference. Instead of minimizing the cost of 
satisfying all requests, we define for each time / and any possible pair of server 
configurations, one at time t- 1 and one at time t, a net benefit gained by 
satisfying the request starting from the configuration at time t~ 1 and ending at 
the configuration at time t. (Note that configuration at time t must include at 
least one server at the point of the request.) This benefit has to be non- 
negative and is composed of a fixed positive component that is reduced by the 
cost to move from the configuration at time t- 1 to the configuration at time /. 
The goal in this case is to maximize the total benefit. It is not difficult to see 
that this problem can also be modeled by the benefit task system defined 
above and thus all our results apply to the benefit version of the ^-server 
problem. 

Similarly, consider the benefit version of a metrical task system. This 
version is similar to the classical metrical task system (see, for example, A. 
Borodin, N. Linial and M. E. Saks, "An optimal on-line algorithm for metrical 
task system", J. ACM, 39 (1992), pp. 745-763), with the difference that each 
task is associated with a vector of benefits, one for each state, such that the net 
benefit after subtracting the transition cost is non-negative. This model as well 
can be cast as a benefit task system and our results apply. 
-0831US1 
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Deterministic Online Algorithms for Benefit Task Systems 



Here, we describe two deterministic algorithms for benefit task 
systems with lookahead L. The first algorithm is a future discounting 



algorithm that achieves a competitive ratio of I V i+1 - Asymptotically 



5 in I, this ratio is 1 + . The second algorithm is an intermittent reset 

L 

algorithm that achieves a competitive ratio of 1 + 4 , which is better than 

(L- 7) 

the first algorithm when L is sufficiently large. 

The future discounting algorithm 

This is a deterministic algorithm for the benefit task system problem 

Li 



1 0 that achieves competitive ratio 



fL+l . The algorithm is based on 



discounting future benefits in an exponential way. Consider a strategy that 
collects L+l benefits b 0 , b x , ...,b L (in this order) in the next L+l time steps, 
we define its anticipated benefit to be b 0 + 6,/a + . . . +b L /a L (for a parameter 

I y 

a>l that we later choose as a = yjL + 1 ). 

1 5 The discounting algorithm greedily follows at each time step the 

strategy with the maximum anticipated benefit. That is, at every time step the 
algorithm finds a strategy for the next L+l time steps whose anticipated 
benefit is maximal and acts (in this time step) according to this strategy. We 
stress that such a calculation is made every time step, so the strategy chosen in 

20 the current time step does not have to agree with the one chosen in the 
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previous time step. 

We note that this algorithm requires that a strategy with the greatest 
anticipated benefit can be computed efficiently. The following lemma 
provides this requirement when the input describes the states explicitly. Note 
5 that we cannot apply this lemma to the Web server farm problem since here 

the states are implicit and of exponential size, so we use the offline algorithm 

described in copending patent application Serial No. 09/ , (IBM Docket 

YOR9-2000-0824US1). 

Lemma 1. A strategy with the largest anticipated benefit can be computed in 
1 0 time that is polynomial in the number of states. 

Proof. Use dynamic programming that goes interactively over the time steps 
and accumulate discounted benefits. As the additional benefit of the next time 
step depends only on the current state, it suffices to have a table of size 
proportional to the number of states. 

1 5 Theorem 2. The above discounting algorithm has competitive ratio at most 

Proof. Consider the online algorithm at time t, and denote by b 0 , b v . . ., b L the 
sequence of benefits in the strategy with the largest anticipated benefit over all 
strategies that the online algorithm has at time t. Define ON, = b 0 and 
20 ONl\ = b 1 /a+...+ b L /a L , then ON t + ON* + \ is the largest anticipated benefit 

over all the strategies that are available to the online algorithm at time t. Since 
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the online algorithm follows this strategy at time t, ON, is the benefit that the 
online collects for this time step. 

Fixing an offline algorithm arbitrarily, let OFF, denote the benefit that 

this offline algorithm collects at time t, and let OFF, 1 be a shorthand for 

5 OFF,+Va+. . .+OFF,Ja L . 

One strategy that is considered by the online algorithm at time t is to 
first join the offline algorithm (i.e., move to the location of the offline at time 
t+l), and then follow the offline algorithm for the next L steps. The 
anticipated benefit of this strategy is at least OFF, L . Since the online 

1 0 algorithm follows at each time step the strategy with the maximum anticipated 
benefit, we get that 

ON, + ON*\ z OFF t L +l (1) 

Another strategy that is considered by the online algorithm at time t is 
to follow the strategy that had the largest anticipated benefit at the previous 
1 5 time t- 1 . (Since the first step of that strategy has already taken at time t- 1 , 
we need to add an arbitrary move at the end.) The contribution of these 
benefits to the anticipated benefit at time t is larger by a factor of a than their 
contribution at time t- 1, and so the anticipated benefit of this strategy is at 
least aON* L . Since the online algorithm follows at each time step the 
20 strategy with the maximum anticipated benefit, we get that 

ON t + ONl\ > aON? 1 (2) 



Adding 



times inequality (1) and — times inequality (2), we get that 

a 



ON, + ONl\ :> ON; L + 



OFF, 1 :, (3) 
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Adding up inequality (3) over all time steps t (and assuming that the demands 
table starts and ends with L time units of zero demand), we get that 



i-i 

a 



a 



a ) 



E OFF,. 



The last equality follows since every benefit that the offline collects is 
accumulated with each of the discounts 1/a, . . ., lid 1 . We conclude that the 



competitive ratio of the algorithm is (choosing a = y/L+l ) 



11 OFF, 



a 



a-1 = = (1+1) 
a 1 -! a 
a £ (a-l) 



1 + 1 
L 



An intermittent reset algorithm 

This is a deterministic intermittent reset online algorithm that has 



10 competitive ratio l+O 



— for the benefit task system problem. This 



algorithm is motivated by the randomized algorithm described below that 
wastes one of every L+l steps in order to gain the optimal benefit between 
wasted steps. Here, we waste one in every &(L) steps, but the choice of steps 
to waste is done carefully in a deterministic fashion and the overall benefit 
1 5 that we gain is close to that of the randomized algorithm. 

The algorithm works in iterations each of length at least 1/2. Let s, 
denote the start time of iteration i. Each iteration i consists of four steps as 
follows: 

Step 1 : Consider all times t=s+L/2+\...,s+L. For each such time t, let x, 
20 denote the maximum benefit of any transition from a state at time t to a 

YOR9-2000-0831US1 
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state at time f+1. Let The the time t that minimizes x t , i.e., 
x r =min {x^+JL/2+1 < t<s+L] . 
Step 2: Compute the optimal sequence of transitions that starts at any state at 
time s,+l and ends at any state at time T. This can be done either by 

5 dynamic programming similar to Lemma 2 or by applying the offline 

algorithm above. (We remark that in the first iteration, the initial state 
is given and thus we may compute the optimal sequence of transitions 
from the initial state to any state at time T. The next step has to be 
modified accordingly.) 

1 0 Step 3 : Move from the current state at time ^ to the state at time s +1 that is 
the starting state of the optimal sequence computed above, and 
continue with the optimal sequence of moves to the state at time T. 
Step 4: Set the starting point s i+{ =T for the next iteration. 

Theorem 3. The above intermittent reset algorithm has competitive ratio at 
4 

15 most 1 + . 

1-7 

Proof. We assume I is a multiple of 4, and otherwise round down to the 
nearest multiple of 4 (in the expense of increasing the competitive ratio as 
explained later). We will call the starting times of the iterations "gaps". 
Denote the gaps by s l9 s 29 . . . We divide the transitions taken by the algorithm 

20 into two components, the moves taken in the gaps and the steps taken in the 
rest of the times. Consider the path taken by the online algorithm. It may not 
collect any benefit in the gaps, but since it computes the optimal paths 
between gaps (allowing arbitrary initial and terminal states adjacent to the 
gaps), clearly the benefit of the optimal offline between gaps is bounded by 

25 the benefit of the online algorithm between gaps. 
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Now we bound the benefit of the optimal algorithm in the gaps. 
Consider iteration i starting at si and computing T=s t +l. Consider the portion 
of the online path from s t +L/2+l to si+1. Denote the length of this portion by 
a, i.e., a=s /+1 -(^+Z,/2+l). Consider the portion of the path computed in the 
next iteration from s i+] +l to s+L+\ . Denote the length of this portion by 6, 
i.e., b=s i +L+l-(s i+l +l). By our choice of L, 1/2 is even and a+b=L/2- 1. Thus, 
one of a and b is even and the other is odd. Assume a is odd and b is even; the 
proof is similar in the opposite case. (We note that ideally, we would like both 
a and b to be even, but choosing a+b to be even may lead to a worse case 
when both are odd.) 

We claim that the benefit achieved by the online algorithm in these 
portions of the path at least x T -(a+b~ l)/2. This is true since in every time step 
in the range there is a transition of benefit x T and thus the online algorithm can 
achieve benefit at least x T on every other transition, even if it collects no 
benefit on the other half of the transitions. Since a is odd, it can collect at least 
x T -(a- 1)/2 in the first portion, and since b is even, it can collect at least x T 'b/2 
in the second. 

Putting these together along with a+b=L/2- 1, we have that the online 
benefit is at least 



a-l + b 
2 + 2) 



Xy * 



Since s+L+l is always less than ^ /+1 +I/2+l, these portions are disjoint among 



iterations. Summing over all iterations, we have ON ■ 



< 4 N 



i 



\L-4) 

where ON denotes the total benefit collected by the online algorithm and x Tj 
denotes the maximum benefit possible in the f th gap. Accounting for the 
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portions outside the gaps, we have ON z OFF- J} x T ^ where OFF denotes 

the optimal offline benefit. Putting these together and allowing for values of L 

4 

that are not multiples of 4 yields the desired bound of 1 + — - . 



Special Cases of the Server Allocation Problem 

5 We devise a deterministic online algorithm for the Web server farm 

problem with one server, two sites and lookahead Z=l. 

One server and two sites with lookahead one 

We devise a deterministic online algorithm for the Web server farm 
problem with one server, two sites and lookahead 1=1 . The algorithm has 

10 competitive ratio T=1+(J), where <|> = 1±JL * 1.618 is the golden ratio. This 

2 

ratio is nearly the best possible in this case, as we present a lower bound 

of hH. * 2.598 on the competitive ratio of any deterministic algorithm for 
2 

this problem. We remark that the competitive ratio that we obtain in this case 
is much better than in the case of arbitrary number of sites, where the best 

1 5 possible competitive ratio is 4. 

The algorithm is based on discounting future benefits. It is similar to 
the future discounting algorithm presented above (specialized to the case 1=1) 
with the exception that the discount factor is taken to be The analysis for 
this case adds some complication to that of the general case, in order to take 

20 advantage of the limited number of sites. For example, a discount factor of 2 
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optimizes the analysis of the section relating to the future discounting 
algorithm, and then the competitive ratio is shown to be at most 4. The actual 
competitive ratio in this case is 3, which can be shown by an analysis similar 
to the one that we give here. 

The following theorem states the improved competitive ratio for this 

case. 

Theorem 4. The discounting algorithm with discount factor (j) achieves 
competitive ratio r=l+(j)~2.618 for the web server farm problem with one 
server and two sites. 



10 Proof. Let us first describe the algorithm in more detail. We associate with 

0 each possible allocation its anticipated benefit, which combines the immediate 

benefit of this allocation and its future benefit. To compensate for the 
uncertainty of a future benefit (the algorithm might later decide to not collect 
these benefits), we discount it (with respect to an immediate benefit) by a 
15 factor of (J). In other words, an action that causes the algorithm to collect a 
benefit b and to have the potential to collect in the future a benefit b f has 
□ anticipated benefit of (|) ■ b + b ' . 

Suppose that the situation of the online algorithm at some time t is as 
described below. Assume, without loss of generality, the allocation of the 
20 server in the previous time unit is to the first site, as denoted by a triangle. 

Since the lookahead is L=l, the algorithm knows only the demands in the 
current time and in the next t+\ . 

\ >x x f ? ? . 



y ? ? ? 



The algorithm decides on the allocation of the server at time t as follows. If 
25 (j) • x + x > y 9 then the previous allocation is kept, i.e., the online algorithm 
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allocates the server at time / to the first site. Otherwise the algorithm changes 
the allocation and the server is allocated to the second site. Observe that this 
decision matches the largest anticipated benefit for the algorithm. If it decides 
to stay with the allocation as before, it collects a benefit of x and will have the 
5 option to collect at next time t+l an additional benefit x, yielding an 

anticipated benefit (j> • x + x. If the algorithm decides to change the allocation 
to the other site, then it collects no immediate benefit (at this point there is no 
way for the online algorithm to collect the benefit y), and in the next time t+l 
it will have the option to collect a benefit y', yielding an anticipated benefit 
10 <j)-0+/=y'. 
y. We will prove by induction on T that 

I r£ ON t+ q T+l - ON* +l > OFF t +OFF^ 

m HT t<.T 



G 



where ON, is the actual benefit collected at time t by the online 

algorithm; ON* is the potential benefit that the online algorithm will collect 

15 at time t if it remains allocated to the same site as in time /- 1 ; OFF t , OFF* 

are defined similarly for an arbitrary offline algorithm; and q, = <j> if at time 
t- 1 the online and the offline solutions have the same allocation (i.e., if they 
have the same "starting position" for time t), and q t = 1 otherwise. (Note that 
ON, is either ON* in case the potential benefit is gained or 0.) In a sense, this 

20 is a potential function analysis; see for example A. Borodin and R. El-Yaniv, 
On-line Computation and Competitive Analysis, Cambridge University Press 
(1998). 

Proving Equation (4) would complete the proof, since we can assume 
that the demands matrix is padded with zeros from both sides, and obtain for 
25 the last time Tthat r£ ON t k £ OFF v as required. 



YOR9-2000-0831US1 



20 

The base case of the induction follows from padding the demands 
matrix with zeros from both sides. For the inductive step, i.e., that the 
induction hypothesis (4) for time T follows from that of time T- 1, it suffices 
to show that 

5 r • ON T + q T+x • ON*+ x - q T • ON} z OFF T + OFF^ x - OFF; (5) 

We prove inequality (5) by verifying it over all cases. Denote the benefits at 
times T, T+1 by x, x 9 y 9 y' as above. Assume without loss of generality that at 
time T- 1 the server is allocated to the first site. The total number of cases is 
eight since there are two possible decisions for the online, and four possible 
1 0 decisions for the offline. 

Consider first the cases in which the allocation of the offline at times T 

and T+\ are the same. Then at time 7 the offline collects the benefit OFF* , 

and hence OFF t = OFF* . In addition ON* = x, so inequality (5) 
simplifies to 

15 r ■ tON T + q T+l • ON* +l - q T • x > OFF^ +1 

1 . q T - (J), q T+l = ({). Then both offline and online collect x. We need to show 
that 

r-x + ^-x'-cfr-x^x' 

i.e., that x + ((J>- l)x'> 0 which clearly holds. / 
20 2. q T = 1 , ^ r+l = 1 . Then offline collects y and online collects x. We need to 

show that 

r • x + x' - x * y' 
i.e., that (j) • x' > y ; , which holds since online keeps the previous 
allocation. / 

25 3 . q T = (j), # r+1 = 1 . Then offline collects x and online does not collect x. We 

need to show that 
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r • 0 + y - (J>-jc^x ; 
which holds since online changes allocation. / 

4. q T =l, q T+l = ((). Then offline collects y and online does not collect x. We 

need to show that 
5 r-O + fy-y'-xzy* 

i.e., that y' > l/((j)- 1) • x = (|) *x which holds since online changes 
allocation. / 

Consider now the cases in which the allocation of the offline at times T 
and T+l are different. Then the benefit of the offline at time Tis OFF T = 0. In 
10 addition ONj = x, so inequality (5) simplifies to 

r-ON T + q T+l • ON* +l -q T • jc * 0FF r * +1 - 6>FF r * 

5. q T = (j), = (j). Then offline and online both do not collect*. We need to 

show that 

r • 0 + 4> • y - $ • x > y f - x 
15 i.e., thaty > x, which clearly holds since online changes allocation. / 

6. q T = i 9 g T+l = i. Then offline does not collect^ and online does not collect x. 

We need to show that 

r • 0 + y ■ x i x' - y 
and since online changes allocation we indeed get 
20 y' > <j) • x + x' > x + x' - y. / 

7. q T =<J), q nl = 1 . Then offline does not collect x and online collects x. We 

need to show that 

r - x + x* - <|) • x ^ y - x 
i.e. that 2x + x ; > y which holds since online keeps the previous 
25 allocation. S 

8. fl r = 1, tf™ = 4>. Then offline does not collect >> and online collects x. We 

need to show that 
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i.e. that (j) • jc + ((j) - l)x' + y > 0 which clearly holds. / 



Multiple Servers with lookahead one 

We present an online algorithm for the web server farm problem with 
5 multiple servers and lookahead one. 

For the case of an even k, we describe below the algorithm and its 
analysis. In the case of an odd k>3, the algorithm is more involved. We 
remark that it applies the algorithm below (for an even number of servers) 
on the "first" k- 1 servers, and yet another algorithm for the additional server. 
10 For an even k, the algorithm splits the set of servers into two sets of 

equal size. The first set of servers is allocated so as to always collect the k/2 
biggest benefits (demands) on the odd time steps. These servers spend the 
even time steps in reallocation to those sites where they will be able to collect 
in the following time step, which is odd, the k/2 biggest benefits. On even 
1 5 time steps, these servers may collect zero benefit in the worst case. The 

Mr 

fy second set of servers is scheduled in a symmetric way. It collects the k/2 

biggest benefits on the even time steps and spend the odd time steps on 
reallocation to the best sites for the following time step which is even. We 
conclude that at each time t, the total benefit that the k servers collect is at 
20 least as large as the k/2 biggest benefits at this time t, which is clearly at least 

half the benefit that an optimal offline algorithm can collect at this time t. 
Therefore, the competitive ratio of this algorithm is at most 2. 

Theorem 5. There is a deterministic algorithm for the web server farm 
problem that achieves competitive ratio 2 when the number of servers k is 
25 even, and competitive ratio c k =2+2/(2k-l) when k>3 is odd.. 
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Proof. The case of an even k was already shown above. We thus deal below 
with the case that k is odd. 

For an odd £, the algorithm splits the servers into three groups. Two 
groups are of size (k - 1)/2 each, and the third group contains one server. The 
5 first group of servers collects the (k - 1)/2 biggest benefits on the odd time 

steps. The second group collects the (k - 1)/2 biggest benefits on the even time 
steps. Let F t denote the (k + l)/2 biggest benefit on step t. The last server tries 
to collect F t on each time step, as follows. On the step t the server either 
collects F t or it moves to the web site q such that d q t+ j = F m . The server 

1 0 collects F t if 2F t > F t+X ; otherwise, the server moves to the web site g. If the 
server collects F t on some step U it wastes the step ^1 on a move to the 
(k + l)/2'th best web site q for the step t+2, i.e., d q t+2 = F t+2 . 

For the sake of analysis we split time into phases: the first phase starts 
at the time step 0 (by convention we assume that benefits at this time step are 

15 equal to zero), phase ends at the time step t+1 if the server from the third 

group collects Ft on step t, the next phase starts at step t+2 and so on. Notice 
that the server from the third group always starts the phase allocated at the 
(k + l)/2'th best web site. Consider the first phase. We claim 

that F t > —52 F r Indeed > 2F i * F »i far / = 1, . . J - 1 and 2F t > F M ; 
20 therefore, 

i=i «=i 2 ,_ ' j=o y 

For p <, k, if S(z,p) is a sum ofp best benefits on step T, 
then Y, s (*> P) a -X) 0FF i md 
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T = l 1=1 \ 

r 3(fc-l) + - l 



2 , 



EOFF r = 2^-OFF = c^OFF. 



The same proof works for any phase and therefore we proved that our 
algorithm is c r competitive for odd k >3. 

Randomized Online Algorithms 

5 In this section we derive tight bounds on the competitive ratio of 

randomized online algorithms with lookahead L. These bounds are stated in 
the following theorem We remark that our randomized algorithm requires only 
0(log L) random bits, independent of the input length. 

Theorem 6. There is a randomized online algorithm with competitive ratio 
10 1+1/L. Furthermore, no randomized algorithm achieves a lower competitive 

ratio. 

Proof. The idea behind the algorithm is that with lookahead L, we can 
compute an optimal path for L time steps. However, we may have to make a 
poor move, in which we collect little or no benefit, in order to reach the first 
1 5 state on this path. After following this path, we can again choose the best 

state to move to during the next time step, possibly without collecting any 
benefit at all, but ensuring the best starting point for the next L steps. We lose 
only a 1/(1+1) fraction of the available benefit over each L+l time steps. We 
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refer to this as the "resetting" algorithm, since it operates in stages, after each 
of which it resets to an optimal state for the upcoming stage. The online 
algorithm will collect at least as much benefit as the optimal offline algorithm 
on every step except the resetting steps. Notice that there are L+\ possible 
5 "phases", i.e., phase j means (potentially) giving up all benefit during steps 

i • (L+l) + j for each i, where 0 <j < L. Randomizing over these choices, the 
online algorithm loses in expectation only a 1/(Z+1) fraction of the offline 

benefit, and the competitive ratio is at most = 1 + y . 

For any e > 0, we demonstrate a lower bound of (ZrH)/(Z+6) for the 
10 one-server problem. Our randomized adversary generates a demand matrix di,t 

with fl/e~| rows and L+2 columns (which can be repeated indefinitely), and a 
% revenue matrix of all ones. The first column of the demand matrix contains all 

0 zeroes, the next L columns contain all ones, and exactly one randomly chosen 

row in the last column contains a one and the rest contain zeroes. The optimal 
15 choice is for the server to move during the first step to the row containing the 

one in column 1+2, and to stay in that row for L+\ steps, collecting a total 
benefit of L+l. Any randomized online strategy stands only an 6 chance of 
choosing this row, and with probability at least 1-6 must either miss the 
benefit in this column, or forgo benefit in some earlier column in order to 
20 collect it. Thus, the expected benefit collected by the randomized online 

algorithm is at most 

Returning now to the drawings, the computation of new allocations 
(function block 23 in Figure 2) is shown in Figure 3. The process begins with 
an initialization step in function block 31 where t is set to the current time. In 
25 function block 32, the discounted revenues D(c, t+i) are computed for each 

customer c and each time period from current time t through current time plus 



ru 
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lookahead M-Z, using actual revenues R(c, t+i) and discount factor alpha: 

a 

Based on this computation, the optimal allocations from the current time 
through current time plus lookahead are found, starting from the current 
allocation, in function block 33. Finally, in function block 34, allocations and 
transitions in allocations for the next time period from t to £+1 are returned. 

While the invention has been described in terms of a preferred 
embodiment, those skilled in the art will recognize that the invention can be 
practiced with modification within the scope of the appended claims. 
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